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ABSTRACT 

While it is well-known that "biased galaxy formation" can increase the strength of 
galaxy clustering, it is less clear whether straightforward biasing schemes can change 
the shape of the galaxy correlation function on large scales. Here we consider "local" 
biasing models, in which the galaxy density field 6g at a point x is a function of the 
matter density field 6 at that point: 5g = f{d). We consider both deterministic biasing, 
where / is simply a function, and stochastic biasing, in which the galaxy density Sg is 
a random variable whose distribution depends on the matter density: 6g = X{6). We 
show that even when this mapping is performed on a highly nonlinear density field with 
a hierarchical correlation structure, the correlation function ^ is simply scaled up by 
a constant, as long as ^ ^ 1. In stochastic biasing models, the galaxy autocorrelation 
function behaves exactly as in deterministic models, with X(6) (the mean value of X 
for a given value of 6) taking the role of the deterministic bias function. We extend our 
results to the power spectrum P{k), showing that for sufficiently small k, the effect of 
local biasing is equivalent to the multiplication of P{k) by a constant, with the addition 
of a constant term. If a cosmological model predicts a large-scale mass correlation 
function in conflict with the shape of the observed galaxy correlation function, then 
the model cannot be rescued by appealing to a complicated but local relation between 
galaxies and mass. 

Subject headings: galaxies: clustering, large-scale structure of universe 



1. Introduction 

If galaxies form with greater efficiency (per unit mass) in high density regions, then their 
clustering can be amplified with respect to that of the underlying mass distribution (jKaiser 1984) 



This amplification is often summarized in terms of a "bias factor" b, where b"^ = Cg{r)/S,{r) is the 
ratio of the galaxy autocorrelation function to the mass autocorrelation function. Biased galaxy 
formation plays a crucial role in cosmological scenarios that assume a critical density (il = 1) 
universe, since these models predict excessively high velocity dispersions in galaxy groups and 
clusters unless the amplitude of mass correlations is lower than the observed amplitude of galaxy 
correlations ( pavis et al. 1985 ; Bardeen et al. 1986 ). At first glance, it appears obvious that 



bias can alter the shape of the autocorrelation function in addition to changing the amplitude, 
since one can simply write the bias factor & as a bias function b{r). However, a physical theory 
of biased galaxy formation cannot specify b(r) directly, it can only specify how the efficiency of 
galaxy formation depends on environment — b{r) is an output of such a theory, not an input. For 
example, the widely examined, "high peak" model of galaxy formation predicts a scale-independent 
bias factor, at least in the linear regime ( pardeen et al. 1986| ). 

The possibility of scale-dependent bias became a serious issue once it was shown that the 
shape of the galaxy autocorrelation function differed from the shape predicted by the "standard" 
cold dark matter (CDM) model, on large scales close to the linear regime ( [Maddox et al. 1990| ). 
With scale-dependent bias, one could in principle resolve this discrepancy by appealing to the 
complex astrophysics of galaxy formation instead of altering the CDM model's fundamental 
cosmological assumptions (e.g., the value of 17). However, the specific schemes that have been 
proposed to achieve the requisite scale-dependence are all non-local; the efficiency of galaxy 



formation is directly modulated in a coherent fashion over large scales (Babul & White 1991 



Bower et al. 1993). While this sort of coherent modulation is physically possible, it seems a priori 



less natural than models in which the efficiency of galaxy formation depends only on properties 
of the local environment. Weinberg (1995) and Mann, Peacock, &; Heavens (1997) applied a 
wide range of local biasing schemes to cosmological N-body simulations, and they found that 



these schemes did not change the shape of the galaxy autocorrelation function or of its Fourier 
transform, the power spectrum, on large scales, though they did alter the shape in the nonlinear 
regime. 

Is non-locality essential to producing scale-dependent bias on large scales? In this paper we 
address this question analytically, extending results from earlier work. Coles (1993) showed that an 
arbitrary local bias applied to a Gaussian density field amplifies (or depresses) the autocorrelation 
function by a constant multiplicative factor. His argument works for Gaussian fields even when 
the rms fluctuations are nonlinear, but in the real universe the nonlinear density field cannot be 
Gaussian because densities cannot be negative. In practice, the efficiency of galaxy formation may 
depend on the mass density averaged over some fairly small, nonlinear scale, and there will almost 
certainly be scatter about the mean relation between galaxy and mass densities because of the 
influence of a variety of physical effects. 

In a seminal paper, Fry &: Gaztahaga (1993, hereafter FG) examined biasing schemes in 
which the galaxy density is an arbitrary function of the local mass density. FG expand the 
biasing function in a Taylor series, and they show that if the cumulants of the mass density field 
exhibit hierarchical relations, then the cumulants of the locally biased galaxy density field also 
exhibit hierarchical relations in the limit that (5^) ^ 1. FG examined only one-point distribution 
functions, but their approach can be generalized to deal with correlation functions at non-zero 
separation [see, e.g.. Fry (1994) for a discussion of the three-point function]. 

The arguments in §§2 and 3 below extend the FG results in two ways. First, we show that 
if the mass clustering follows a hierarchical pattern, then local bias multiplies the autocorrelation 
function by a constant factor on large scales (where .^ ^ 1), even if the bias is applied on a scale 
where the density field is nonlinear. We then show that this result carries over to stochastic local 
biasing, in which the galaxy density is a random variable whose mean value is a local function of 
the matter density. In §4 we show how our results for the galaxy autocorrelation function translate 
into results for the power spectrum. We summarize our conclusions in §5. 



2. Deterministic Local Bias 

A general form of deterministic focal bias relates the density fluctuation field of the galaxies, 
6g, to the density fluctuation field of the matter, 5, at the same point x through an arbitrary 
function /: 

<5,(x) = /[<5(x)]. (1) 

[We use quantities without subscripts, such as 5 and ^, to refer to the underlying matter 
distribution, and subscripted quantities like 6g and ^g to refer to the biased distribution of 
galaxies.] Although equation (|^) represents the most general form of local bias in which 6g is a 
function only of 6, one could imagine more general local functions in which 6g is also a function 
of, for example, the local velocity field or derivatives of the local gravitational potential. Implicit 
in equation ([l|) is a smoothing scale on which the continuous fields (5(x) and Sg{x.) are defined. 
Physically, this scale indicates the range over which the environment directly influences the 
efficiency of galaxy formation. In a random field with significant long wavelength power, the 
local density contrast is itself correlated with the density contrast on larger scales, and it is 
this correlation that allows a local transformation to amplify ^{r) by a constant factor on large 
scales (Kaiser 1984). However, the Bower et al. (1993) model for scale-dependent bias effectively 
incorporates an "influence" scale of lO's of Mpc, implying that a forming galaxy is "aware" of the 
physical conditions far away. 

Coles (1993) and FG have demonstrated a number of important properties of biasing models 
defined by equation (|l|). For the case where 5 is a Gaussian field. Coles (1993) shows that 
^g(r) oc S,{r) on all scales where ^(r) ^ 1 for almost any choice of the function /. In other words, 
arbitrary local biasing of a Gaussian density field does not alter the shape of the autocorrelation 
function on large scales. Coles also notes that his argument fails for some simple, albeit physically 
unlikely functions, such as 6g = 6'^ — (5^). FG expand the function / in a Taylor series: 

oo 

k\' 



m = E ?^^'> (2) 
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where ^q is chosen to give {5g) = 0. They then derive the cumulants of 6g in terms of the cuniulants 
of 6 and the biasing coefficients b^, in the hniit that (5^) ^ 1. In this hmit, it is obvious from 
equation (g) that the leading-order effect on the variance is a^ = bla"^; in FG's notation 

^g,2 = bl'^2 + 0{f2), (3) 

where ^2 = i^'^) = c^^ (see FG, equation [9]). 

With the FG expansion (equation |2|), the galaxy autocorrelation function can be written 

eg(Xl,X2) = (<5g(Xl)5g(X2)), (4) 

OD r T 

If the smoothing scale on which S{x.) is defined is large enough, then {6'^) <^ 1, and only the 
j = k = 1 term survives, implying that 

e,(xi,x2) = 6?e(xi,x2) + c:)(c2)^ (g) 

In other words, if there is a deterministic local relation between galaxy density and mass density on 
some scale in the linear regime, then the autocorrelation function in the linear regime is multiplied 
by a scale-independent factor b^, where 61 is the first derivative of the local bias function f{6) 
evaluated at 6 = 0. This argument is a trivial extension of the one-point argument for the variance 
given by FG, analogous to Fry's (1994) extension of the FG skewness result to the three-point 
correlation function. 

What if the density field is nonlinear on the scale where local bias operates, so that (5^) ^ 1? 
This situation is physically plausible, and we are no longer free to discard the higher-order terms 
in the sum in equation (g). We can still make progress if we introduce the assumption that the 
clustering is hierarchical, i.e., the connected part of ((5(xi)''5(x2) ) is given by (Peebles 1980; Fry 
1984; Bernardeau 1996) 

{6{^iy6{^2)')c = C,- fc(52)^-+^-2(5(xi)<5(x2)) + 0{e). (7) 

Although the assumption of hierarchical clustering can only be shown to be rigorously valid in the 
quasilinear regime, numerical simulations show that it holds to a fairly good approximation even in 
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the nonlinear regime (Colonibi, Bouchet, & Schaeffer, 1994; Colombi, Bouchet, & Hernquist 1996; 
see Suto &: Matsubara 1994 for the opposing point of view), and there are theoretical grounds 
for believing that hierarchical clustering should apply in the nonlinear regime (Davis & Peebles 
1977; Peebles 1980; Balian & Schaeffer 1989). There is also support for hierarchical clustering in 
the observed galaxy distribution (see, for example, Szapudi, et al. 1995), but this is not directly 
relevant to our argument, since we are interested in the dark matter clustering hierarchy, which 
cannot be observed directly. The validity of equation (|^) for our evolved density field is the key 
assumption we make in this section; it allows us to generalize equation (0) to the biasing of 
nonlinear fields. Bernardeau (1996) begins with equation (0) and derives a gravitationally induced 
"bias" , but this differs from the arbitrary bias functions we are dealing with here. 

With the hierarchical assumption, we can write 

oo , , 
e3(xi,X2) = Yl -^[CM^y^'''^i^l^^2) + 0{e) + {K^iyK^2f)unconnected]. (8) 

The first two terms arise from the connected part of ((5(xi)-'(5(x2 )*"'), while the last term is 
the unconnected part. Note, however, that this unconnected part can be written as powers 
of lower-order correlations, which can themselves all be expanded out according to equation 
(|^. [There are no terms of zero-th order in ^ arising from the unconnected terms of the form 
(5(xi)-')((5(x2) ), because all such terms are cancelled by other terms included in 60 •] In the end, 
we obtain: 



Cg(xi,X2) 



bjh,2^j+k-2 



jk ■?•^• 



e(xi,X2) + 0(C2^, (9) 



where Kji^. is a set of constants. Hence, we find that for .^ ^ 1, the quantity 6^ = S,g{r)/6.{f) is 
approximately constant. Again, we wish to emphasize that we have assumed nothing about the 
linearity of the density field at the scale of biasing; all we have assumed is the validity of equation 

Both the Coles (1993) result for Gaussian initial conditions and the FG result (equation g) 
are special cases of our general result. If the underlying mass density field is Gaussian, as in the 
case discussed by Coles (1993), then the density field is hierarchical in the sense that equation (^ 



is satisified, but all of the hierarchical coefficients Cj^k vanish except for Ci^i, which is unity. Then 



our conditions are satisfied, and ^„(xi,X2) = 6^^(xi,X2). Formally, the Coles result holds even for 



the case ((5^) > 1, but this is not a physically realistic case, since the density field will be Gaussian 
only for (d^) < f . 

To obtain the FG result, we simply take cr^ <C 1 in equation (^). Then the j = k = 1 term 
dominates, and we re-obtain equation (P). This equation differs from our more general result in 
that if the local bias is applied on a nonlinear scale, then all of the Taylor series coefficients of the 
bias function contribute to determining the bias factor on large scales, not just 6i. 

Our argument for scale-independent bias fails when ^ becomes larger than unity, which is 
a good thing, since local bias can change the shape of the autocorrelation function and power 
spectrum in this regime (Weinberg 1995; Mann et al. 1997| ). Note, however, that our argument 



does hold even for the case of quadratic biasing, f{6) = 6'^ — {6'^). The reason that the Coles (1993) 
argument fails in this case is that a Gaussian density field has no connected higher moments, so 
terms linear in ^ vanish. 



3. Stochastic Local Bias 

The bias model of equation (Q) can at best be an idealization. Even in the case where galaxies 
"trace the mass," f{6) = 6, there will be Poisson fluctuations about the mean relation because of 
the discrete nature of the galaxy distribution. More generally, we expect the probability of forming 
a galaxy in a given region to depend on many factors, including the history of accretion and 
mergers in the nearby environment. Many of these factors will be correlated with the local density, 
but they will not be completely determined by it. We can quantify our ignorance by allowing for 
stochastic bias, in which the galaxy density is a random variable which depends in some way on 
the underlying matter density, but which is not completely determined by it. Little previous work 
has been done on stochastic bias models, although Pen (1997) has recently attempted to model 
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the joint galaxy-matter probability distribution function using a bivariate Gaussian as a starting 
point. 

Let us therefore assume that the galaxy density 5g at a point x is a random variable X, which 
is a function of the underlying matter density at that same point: 

5,(x)=X[5(x)]. (10) 

We again assume that X includes a constant term that gives {5g) = 0. As in the case of 
deterministic local bias, we assume some smoothing scale over which 5g and 5 are defined, so that 
the bias does not occur at a geometric point, but over some small volume. The random variable X 
is uniquely specified by the probability of producing a particular value of X given an underlying 
value of (5, which we write in the standard way as p{X\5), the probability of X given 5. Note 
that our assumption that the stochastic bias is purely local is actually very restrictive. It means, 
for example, that the distribution of the random variable X is the same at every point in space 
with the same (5, and that there are no correlations between this distribution at different points in 
space. 

The probability of measuring a galaxy density 5gi at the point xi and a galaxy density 5g2 at 
the point X2 is 

P{5gu5g2) = Jp{Xi\6i)p{X2\62)p{6i,62)dSidS2, (11) 

where we have used the "1" and "2" subscripts to denote the values of X and 5 at the points xi 
and X2 and p{5i, 62) to denote the two-point probability distribution of the matter density at these 
points. [Note that 6g = X in this equation.] For this model, the galaxy autocorrelation function is 

e3(xi,X2) = (X(X1)X(X2)), (12) 

= f XiX2P{Xi\5i)PiX2\62)piSi,62)d5id62dXidX2. (13) 

We can perform the integration over Xi and X2 to obtain 

C3(xi,X2)= (x{5i)X{52)p{5i,52)d5id52, (14) 
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where X{S) is the mean value of X for a given value of 6. This result generalizes in a 
straightforward way to all of the higher-order correlation functions. 



The argument that leads from equation (10) to equation ( |14D is almost trivial, but the result 



is rather remarkable. It shows that the calculation of the correlation function for the most general 
possible stochastic local biasing model can be reduced to the equivalent problem for a deterministic 
local bias, with X{5) taking the role of the bias function. Hence, all of the mathematical machinery 
developed here and in other papers for the problem of deterministic local bias can be used for 
stochastic bias. Thus far, we have made no assumptions about the underlying density field 5. If we 
now repeat our assumption from the previous section that 6 exhibits hierarchical clustering, then 
we obtain the same result as in the previous section: £,g/^ is constant as long as .^ <C 1. Equation 
(14) takes a particularly simple form if galaxies trace the mass on average, X{6) = S. In this case. 



we obtain simply £,g{r) = £,{r). This result tells us that the random fluctuations about the mean 
density make no difference in the final autocorrelation function. 

These results may seem counterintuitive, since stochastic bias ought to introduce some sort of 
increased "scatter" in the final density distribution, and it certainly increases the final rms density 
fluctuation. One must remember, however, that ^g represents a volume- averaged correlation 
function, within which all of the random fluctuations have been averaged out. What does change 
for the case of stochastic bias are the random fluctuations relative to ^g. The variance of the 
autocorrelation function at some fixed separation is 

al=j{Sg^Sg,fpiS,,,6g,)d5g,d5g,- 

In terms of our stochastic bias function X{5), this becomes 

a| = j Xi{6if X2{52Yp{h,52)d6id62 - Jxi{6i)X2{62)p{6i,S2)d6id62 

To illustrate the way in which o"? is increased, we consider again the simple class of models in 
which X{6) = 6, and we we use equation (16) to calculate the difference between o"? for the 
stochastic case and cr| for the deterministic case 6g = 6: 

o"^ (stochastic) — cr^ (deterministic) = / (Tx{Si)o'x{S2)p{Si,S2)dSid52- (17) 



2 

SglSg2p{Sgl,6g2)d6gld6g2 ■ 



2 



(15) 



(16) 
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Here cr'j^iS) is the variance of the distribution of X for a given value of 6, 



a\{5) = X{5Y-X{5f. (18) 

Since (t\{5) is positive, this result shows that randomness in the bias function increases the 
fluctuations about the mean value of ^g. Conceptually speaking, equations (|l5|)-([l7|) presume 
that one estimates ^g{r) from many different pairs of positions with spatial separation r (or from 
a single pair of positions in an ensemble of universes) and computes the variance o"? of these 
estimates. In practice, one must average over a large number of position-pairs in order to get an 
estimate of ^^ that is at all useful, but stochastic biasing will still act to increase the variance 
in estimates of ^g from one volume of the universe to another. These fluctuations, which can be 
measured in large redshift surveys, encode information about the degree of stochasticity in the 
galaxy formation process at fixed local mass density. (Of course, even in the absence of stochastic 
bias, the variance in ^g is nonzero). 

The rms fluctuation of a smoothed field can be written as an integral over i{r). The conclusion 
that ^g{r) = ^(r) for X{6) = 5 aX first seems to contradict the obvious fact that stochasticity 
will increase the rms fluctuations smoothed on any length scale. However, these two results are 
not contradictory. Recall that we assumed that the initial density field is smoothed over some 
scale Rs, and that local bias operates over this same smoothing scale. Our assumption that the 
distribution of X is uncorrelated at different points is invalid for separations less than Rg, which 



means that equation (11) also fails on such short separations. This is most obvious for the case of 
zero separation. If we measure the density at a single point xi, then the product of probabilities 
P{Xi\5i)P{X2\52) in equation (|l2|) must be replaced by the single probability P{Xi\5i), and 
equation (|l2|) becomes 



eg(xi,xi) = (X(xi)X(xi)), (19) 

= I XiXiP{Xi\5i)p{5i)d5idXi. (20) 

Integrating over Xi gives 

^g{^i,^i)= JY{Sfp{5)d5. (21) 
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For the deterministic case where Sg = 6, the corresponding quantity is 

U^u^i) = j 5^p{5)d5. (22) 

For the special case where X{5) = 5, we have: 

^3 (stochastic) - ^g (deterministic) = j \X(J^ - X{5f]p{5)d5. (23) 



But X{5Y — X{5)'^ > for all values of 5, so ^^ (stochastic) — ^^ (deterministic) > 0. Thus, 
stochastic bias increases the rms fluctuations, but the entire effect is due to the change in ^g{r) 
at separations smaller than our initial smoothing length; at these length scales our arguments 
regarding the effects of stochastic bias on the autocorrelation function do not apply. 



4. The Pow^er Spectrum 

Although we have focused so far on the autocorrelation function, many observational studies 
of large scale structure use its Fourier transform, the power spectrum, to quantify clustering on 
the largest scales. The Mann et al. (1997) numerical study of local biasing focuses mainly on the 
power spectrum. The mass power spectrum P{k) is related to the mass autocorrelation function 

i{r) by 

P{k)=4TT f ^{rf-^^^^r^dr, (24) 

J kr 

and the galaxy power spectrum is 

Pgik) = Att J ^girf-^^r'dr. (25) 

In §2, we showed that deterministic local bias applied to a hierarchically clustered density field 
gives Cg{f) = b'^S,{r) for ^(r) <^ 1, but we can put no constraint on the bias for ^(r) ^ 1. Let rg be 
a distance such that ^(r) <^ 1 when r > rQ. We can therefore write 

C,(r)=62^(r)+e(r), (26) 
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where ^(r) = for r > tq. Substituting this equation into equations (^) and (p5[), we get 

pJk) = b'^P{k) + 47r r g(r) ''™/^''V ^dr. (27) 

Jo Kr 

If we choose a fixed value of k for which kr^ <C 1, then A;r ^ 1 over the entire range of integration 
in the second term, so this integral just reduces to An /q" ^{r)r'^dr, which is a constant, independent 
of k. Thus, in the regime k <^ 1/ro, 

Pg{k)=b^P{k)+C, (28) 

where b is the large scale bias factor of the autocorrelation function and c is a constant, which 
may be positive or negative. This is just a more rigorous way of noting that the power spectrum 
for k < ko is dominated by the correlation function at r > I/Zcq, though small scale fluctuations 
can add a constant offset to P{k). Equation ( [2^ ) is not quite the same as a scale-independent 
amplification of P{k). However, the power spectrum estimated by Baugh & Efstathiou (1993) 
from the APM survey continues to rise out to 2Tr/k ^ 130/i^^ Mpc, so in realistic models the 
constant c is likely to become unimportant on large scales, at least until one reaches the turnover 
in the power spectrum. 



5. Conclusions 

We have shown that for a local bias function applied to a density field with a hierarchical 
correlation structure, the only effect is to rescale the autocorrelation function by an overall bias 
factor 6^ on length scales for which S,{r) <C 1; no change in the shape of the autocorrelation 
function can be induced by such a local transformation. For the power spectrum, for sufficiently 
small k, the result is also a rescaling, with the possible addition of a constant term. Although we 
have assumed hierarchical clustering, our result holds as long as 

(5(xi)^,5(x2)'=) = Z),- fc(5(xi)<5(x2)) + O(C'), (29) 
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where Dj^ is independent of the separation between xi and X2. Equation (|29| ) is actually a slightly 
weaker condition than the assumption of hierarchical clustering (equation ^) because the moment 
on the left-hand side of equation ([29| ) is not connected. 



If there is a bias between galaxies and mass (and the galaxy morphology-density relation 
implies that there must be bias for at least some kinds of galaxies) , then the physics that causes it 
may well be complex. However, our stochastic biasing result implies that all environmental effects 
on the efficiency of galaxy formation influence ig{r) only to the extent that they are correlated 
with the mass density itself, and if these effects are local, then they still will not change the shape 
of the autocorrelation function on scales in the linear regime. Cen &: Ostriker (1992, figure 4) 
presented a first attempt to calculate the full distribution function P{6g\6), using a hydrodynamic 
cosmological simulation of the standard cold dark matter model. We can expect substantial 
progress from this a priori approach to biased galaxy formation over the next few years, since 
advances in computer power and algorithms now allow simulations of much higher dynamic range 
and permit broader explorations of cosmological parameter space. However, our results imply 
that all of these calculations should produce galaxy populations with ^g{r) oc ^(r) on large scales. 
Only a biasing mechanism that coherently modulates galaxy luminosities on scales larger than 
those over which the matter actually moves, e.g., suppression or enhancement of star formation by 
quasar radiation (Babul & White 1991; Bower et al. 199^), can rescue a cosmological model that 



predicts the wrong shape for ^(r) on the scales where ^(r) <C 1. Since a physical mechanism of this 
sort would surely have a different impact on galaxies of different luminosities and morphological 
types, the giant redshift surveys becoming available in the next few years will allow us to test 
whether non-local biasing occurred in the real universe by comparing the large-scale correlation 
functions of different classes of galaxies. 
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